A framework for systematic promoter motif discovery and expression profiling from high dimensional brain transcriptome data
نویسندگان
چکیده
A framework for systematic promoter motif discovery and expression profiling from high dimensional brain transcriptome data Jeremy A. Lieberman Understanding the regulatory logic of genes across discrete brain substructures can elucidate the basis for neural network connectivity and the cause of disease. Promoter motifs, in particular, that govern high or low expression gene networks present an important fulcrum for phenotypic behavior. Using the Allen Institute Brain Atlas we took various clustering approaches to find closely regulated genes, and generated substructure specific expression profiles to run through FIRE, a motif discovery algorithm and iPAGE, a functional ontology algorithm. Notably, we found a single large cluster of genes that had tightly coordinated behavior across hundreds of brain substructures, as well as a unique upstream promoter signature, yet highly diverse ontological characteristics. We also present a BRain EXpression Profile ASSembly script (BEXPASS) whose output is customized for FIRE and iPAGE input. Lastly we look at language processing and speech control areas of the brain and put forward recommendations for promoters that can serve as part of DNA constructs for optogenetic research an emerging neuroscientific research method that uses bacterial light-gated ion channel protein, channelrhodopsin (ChR1 or ChR2), as an activity control tool to activate neural pathway signaling. ACKNOWLEDGEMENTS I would like to thank the members of Professor Saeed Tavazoie’s lab for supporting my research. In particular, Saeed Tavazoie and Panos Oikonomou for their mentorship and guidance. Also to Peter Freddolino for his expertise of the R language.
منابع مشابه
Network-based transcriptome analysis in salt tolerant and salt sensitive maize (Zea mays L.) genotypes
Identification of genes involved in salinity stress tolerance provides deeper insight into molecular mechanisms underlying salinity tolerance in maize. The present study was conducted in the faculty of agriculture of Urmia university, Iran, in 2018, with the aim of identifying genetic differences between two maize genotypes in tolerance to salinity stress, and the results of gene expression wer...
متن کاملiMoMi (interactive Motif Mining) - a database and utilities to assist the discovery of new regulatory patterns
Detection of DNA binding motifs for regulatory proteins allow to assign with a good reliability the role of each regulator in the cellular metabolism. With the increasing amount of complete genome sequences and the use of transcriptome analysis methods, bioinformatics approaches should contribute to detect most potential regulatory motifs that biologists will be able to confirm by biochemistry ...
متن کاملDaMiRseq-an R/Bioconductor package for data mining of RNA-Seq data: normalization, feature selection and classification.
Summary RNA-Seq is becoming the technique of choice for high-throughput transcriptome profiling, which, besides class comparison for differential expression, promises to be an effective and powerful tool for biomarker discovery. However, a systematic analysis of high-dimensional genomic data is a demanding task for such a purpose. DaMiRseq offers an organized, flexible and convenient framework ...
متن کاملcWords - systematic microRNA regulatory motif discovery from mRNA expression data
BACKGROUND Post-transcriptional regulation of gene expression by small RNAs and RNA binding proteins is of fundamental importance in development of complex organisms, and dysregulation of regulatory RNAs can influence onset, progression and potentially be target for treatment of many diseases. Post-transcriptional regulation by small RNAs is mediated through partial complementary binding to mes...
متن کاملHeterogeneous data fusion for brain tumor classification.
Current research in biomedical informatics involves analysis of multiple heterogeneous data sets. This includes patient demographics, clinical and pathology data, treatment history, patient outcomes as well as gene expression, DNA sequences and other information sources such as gene ontology. Analysis of these data sets could lead to better diseas...
متن کامل